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Abstract 

In this note we obtain tight bounds on the space-complexity of computing the ergodic mea¬ 
sure of a low-dimensional discrete-time dynamical system affected by Gaussian noise. If the 
scale of the noise is e, and the function describing the evolution of the system is not by itself 
a source of computational complexity, then the density function of the ergodic measure can be 
approximated within precision S in space polynomial in log I/e -I- log log 1 /S. We also show that 
this bound is tight up to polynomial factors. 

In the course of showing the above, we prove a result of independent interest in space- 
bounded computation: that it is possible to exponentiate an n by n matrix to an exponentially 
large power in space polylogarithmic in n. 


1 Introduction 

A discrete-time dynamical system is specified by a function / from a space X to itself. One of 
the most important problems in the study of dynamical systems is to understand the limiting or 
asymptotic behavior of such systems; in particular, the limiting distribution of the sequence of 
iterates x, f{x), /(/(x)),.... Combinations of such distributions give rise to invariant measures of 
the system, which describe the asymptotic behavior in statistical terms. The invariant measures 
are supported on invariant sets, which provide a topological description instead. Together, these 
invariant objects completely characterize the asymptotic behavior of the system. 

Ideally, given a dynamical system, we would like to be able to decide properties of its asymptotic 
behavior or to compute (to within some approximation) the invariant objects describing it. Unfor¬ 
tunately, in many cases, simple questions regarding this behavior are undecidable [Moo90, AMP95, 
W0IO2, KL09] and computing the relevant invariant objects is impossible [BY06, BY07, GHRll, 
BBRMll]. The general phenomenon behind these results is that, for many classes of dynamical 
systems, it is possible to ‘embed’ a Turing machine M in the dynamical system so that achieving 
the algorithmic task we are concerned with is equivalent to deciding whether M halts. 

In [BGR12], Braverman, Grigo, and Rojas showed that under the introduction of noise to a 
dynamical system (for almost all ‘natural’ noise functions), the set of invariant measures becomes 

*MB is supported in part by an NSF CAREER award (CCF-1149888), a Turing Centenary Fellowship, a Packard 
Fellowship in Science and Engineering, and the Simons Collaboration on Algorithms and Geometry. CR was par¬ 
tially supported by projects Fondecyt 1150222, DI-782-15/R Universidad Andres Bello and Basal PFB-03 CMM- 
Universidad de Chile. 
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computable. Moreover, in many cases, this set is computable efficiently. Specifically, they show 
(Theorem C in [BGR12]) that if the noise is Gaussian then there is a unique invariant measure 
/i; moreover, if / is polynomial-time integrable (convolutions of polynomials in / with polynomial 
fnnctions can be integrated in polynomial-time), then computing this invariant measure to within 
precision 6 can be done in time 0(poly(log 1/h)). 

The purpose of this paper is to investigate the space complexity of compnting the invariant 
measure of a noisy dynamical system. The algorithm given in Theorem C of [BGR12] for computing 
the invariant measure requires space 0(poly(e“^ log 5“^)). By applying (and developing) techniqnes 
for space-bounded computation, we prove (in Section 4) the following refinement of Theorem C that 
runs in space polylogarithmic of that of the original algorithm (albeit at a cost of a quasi-polynomial 
increase in the running time). 

An additional assumption that we need to make to obtain tight resnlts is that the function / 
itself is not a sonrce of additional space complexity. We say that / is 5-|-log-space integrable, if it 
is possible to integrate the convolution of powers of / with polynomial fnnctions with precision 
in space ©(S' -|- log log 1/C) (see Section 2 for a precise definition)^. 

Theorem 1. Let X = [0,1]. If the noise p)-( 3 ,)(-) is Gaussian, and f is {log ^)+log-space in¬ 
tegrable, then the computation of the invariant measure p, at precision 5 can be done in space 
O (poly (logi -blog log I)). 

We can also replace the assumption that / is (log i)-|-log-space integrable with the assumption 
that / is both log-space computable (i.e. that its values can be computed to within error f in space 
0(loglog 1/C)) and analytic with bounded Taylor series coefficients. In particular, we show that 

Theorem 2. Let X = [0,1]. If the noise p^^(^^^{-) is Gaussian, and f is log-spaee computable, 
smooth, and (for some r] > 0) satisfies \d^f{x)\ < for all x, then the computation of the 
invariant measure p, at preeision 6 can be done in space O (poly [logt] log ^ -|- log log j)). 

For the sake of simplicity, in this note we focus on the case where X = [0,1] (as in [BGR12]). 
Both Theorems 1 and 2, however, can be generalized to the case where X = [0,1]'^. For fixed d, 
the space bounds in Theorems 1 and 2 remain the same; for variable d, the space bounds gain an 
extra factor of poly(d). We explain this in further detail in Remark 12. 

In order to generalize Theorem C of [BGR12] and prove Theorems 1 and 2, we require a 
method to exponentiate n by n matrices np to powers potentially as large as 2P°h(”') in space 
polylogarithmic in n, (the traditional method of iterative squaring only works for powers up to 
poly(n)). To the best of our knowledge, there is no known existing solution to this problem that 
operates in polylogarithmic space. In Section 3, we present such a solution based on approximating 
M® via p{M) for some low degree polynomial p (Theorem 5). This theorem is arguably the main 
technical innovation of this paper: 

Theorem 5. Given ann by n matrix M whose entries are given up to precision 2“P°h(”') and an in¬ 
teger exponent E = 0(2P°h("')), there exists an algorithm that eomputes M® in space 0(poly(log n)) 
to within precision 2“P°h(”') i/||M®]| < 2"' (and otherwise reports that ||M^|| > 2^). 

Finally, in Section 5, we prove a corresponding lower bound, showing that this upper bound is 
tight; the space complexity of computing the invariant measure of such a system cannot be further 
reduced. 

^In fact, the conclusion of Theorem 1 follows even if these convolutions can be computed in space poly(S -I- 

loglogl/C). 
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Theorem 3. Any algorithm that can compute the invariant measure p, to within precision 6 of a dy¬ 
namical system with Gaussian noise kernel and analytic transition function f{x) (that uni¬ 

formly satisfies \d^f{x)\ < klrj^ for some rj = poly(e “^)) requires space at least ri (log ^ + log log j). 

These theorems provide evidence for the Space-Bounded Church-Turing thesis (SECT), in¬ 
troduced by the authors in [BRS15]. The SECT roughly states that a physical system with 
“memory” M is only capable of performing computation in the complexity class SPACE(M‘^(^)), 
where memory is a measure of the amount of information the system can preserve from one 
timestep to the next. For dynamical systems with Gaussian noise of variance e, one can show 
that M = 0(log -); the SECT thus suggests that such dynamical systems are limited to computa¬ 
tions in SPACE(poly log |), which is implied by Theorem 3. See Appendix A for more details. 

1.1 Open Problems 

In this paper we focus exclusively on the case where the noise is Gaussian. It is straightforward 
to adapt the proofs in this paper to other choices of noise functions. It remains unclear, however, 
how the space complexity of computing the invariant measures of / depends precisely on the noise 
function. More specifically, we would like to be able to answer the following problem. 

Problem 4. Can we associate with every random perturbation a value M so that computing the 
invariant measure of a dynamical system with this noise can he done in space 0(poly(logM -|- 
log log 1/(5)), and moreover that this is tight: given a random perturbation with value M, there is 
some function f whose invariant measures subject to this random perturbation take space P(poly (log M-|- 
log log 1/(5))? 

For the case when the random perturbation is Gaussian with variance e^, this note shows that 
it suffices to take M = (or, in the (i-dimensional case, M = 

Acknowledgments 

We would like to thank Eric Allender for his advice on space-bounded computation. 

2 Preliminaries 

2.1 Discrete-time dynamical systems 

We begin by giving a brief description of the relevant aspects of the theory of discrete time dynamical 
systems, largely following the notation of [BGR12]. For a complete treatment see for instance 
[Wal82, Pet83, Mafi87]. 

A dynamical system is a metric space X representing the set of possible states along with a 
map / : A —)• A representing the transitions between states. Given an initial state x G A of 
the system, the trajectory of x is the sequence {x, f{x), f{f{x)), ... }. To avoid certain technical 
pathologies that can arise, throughout the course of this paper we will assume that A is a compact 
Lebesgue-measurable subset of and the function / is continuous. 

Given a probability measure p. over A, we can define the pushforward of pL under / via {ffj,){A) = 
n{f~^{A)) for all events A C X. A probability measure p is invariant for the dynamical system if 
fh = t^- 
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In this note, we focus on the case of dynamical systems with noise. Denote by P{X) the set of 
Borel probability measures over X under the weak convergence topology. A random perturbation S 
of f is given by a family {Qxjxex S P{X) of probability measures over X for each point in x which 
each represent the ‘noise’ at that point. Then, instead of a deterministic trajectory, S induces a 
Markov chain over X, where Pr[xi+i G A] = for all Borel sets A (Z X. Likewise, the 

pushforward of a probability measure p, G P{X) under S is defined by {Sp){A) = f-^ Qp^'){A)dp. 
As before, p is an invariant measure of the random perturbation S of f if Sp = p. 

For simplicity, throughout this paper we will assume that the domain X is the d-dimensional 
cube [0,1]'’* (and for the majority of the discussion, we will focus on the case where d equals 1). 
Moreover, in all of our examples we will be concerned with the case of Gaussian noise with variance 
2e^, where the measure Qx is defined (in the case d = 1) by the probability density function 

Ke{y, x) = Ce{x)—^ exp(-(y - xf I2e^) 
ev 27r 

where C^{x) is a normalization factor so that K^{y,x) has measure 1 over [0,1]; specifically, C^{x) 
is given by 


Ce{x) = ( I exp(-(y - xf/2e^)dy 

\Jo ev27r 

Note that if p{x) is the density function of a probability measure on [0,1], then the density p = Sp 
of the pushforward measure under S is given by 

p{x)=[ p{y)K^{f{y),x)dy 
Jo 

For this reason (following the notation of [BGR12]), we will write Kf{y,x) as shorthand for 
Keifiy),x). We will also write denote the family Qf(x) of probability measures for this 

dynamical system with noise (i.e. the probability measure induced by K^{y,f{x))). 

2.2 Space-bounded computation 

The space complexity classes we consider in this paper are very small; they are (poly)logarithmic in 
the size of the output. To this end, we review some classic results from space-bounded computation. 

A function / is log-spaee computable if it can be computed by a Turing machine with a read-only 
input tape, a one-way write-only output tape, and a read-write work tape of size O(logn). The 
following functions are known to be log-space computable: 

(a) The composition of a constant number of log-space functions. The composition of two functions 
f{g{x)) can be performed by dividing the work tape into two tapes of size O(logn), and using 
the second tape to compute the desired bit of g{x) whenever it is required for f{g{x)). By 
induction, this can be extended to any constant-depth composition of log-space functions. 

(b) Addition of poly(n) n-bit integers. This can be done with via the standard grade-school addition 
algorithm (with some attention paid to how to represent carries). 

(c) Multiplication of two n-bit integers. This follows from b via the standard algorithm for long 
multiplication. 
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(d) Multiplication of two n by re matrices, each of whose entries is an re-bit integer. This follows 
from b and c (each entry is the sum of re products of two re-bit integers). 

(e) Division of two re-bit integers. This result is due to Chiu, Davida, and Litow [CDL95]. The 
main idea of their proof is to represent both numbers in terms of their values modulo various 
small primes, perform the arithmetic operations modulo these small primes, and reconstruct 
the result via the Chinese Remainder Theorem. 

(f) Multiplication of poly(re) re-bit integers. This can be done via the same technique of Chinese 
Remainder representation described in e and is also described in [CDL95]. 

(g) Arithmetic operations on real numbers up to precision This follows from the preceding 

results (we need only additionally keep track of the location of the decimal/binary point, which 
requires at most a logarithmic amount of extra space). 

(h) Computation of factorials and binomial coefficients. This follows from e and f. 

(i) Taking products, powers (with exponents of size poly(re)), and compositions of polynomials 
with degree poly (re) and coefficients of size poly(re). This follows from h, f, and b. 

(j) Computing exp, log, and arctan of numbers to within precision This was originally 

shown by Alt in [Alt84] (in all cases it suffices to approximate these functions via some snffi- 
ciently long prefix of their Taylor series). 

(k) Computing to within precision 2“^°*^^”'^ where x is a real number provided to precision 
2“P°h("-) and E is a poly(re)-bit integer. Again, this was shown by Alt in [Alt84] and essentially 
follows from j by writing x® = exp(iMogx). For completeness, we provide a derivation of this 
fact in Appendix B. 

There are some operations which, while we do not know how to perform in a logarithmic amount 

of space, we do know how to perform in a polylogarithmic amount of space. These include: 

(l) Computing the composition of logarithmically many log-space functions. By similar logic as a 
above, this can be done in space 0(log^ re). 

(m) Computing to within precision 2“^°*^^”), where and M is an re-by-re matrix of poly(re)- 

bit entries. This can be done in space O(log^re) via repeated squaring (this is essentially the 
logic behind Savitch’s theorem, see [Sav70]). 

(n) Compnting the determinant (and more generally, the coefficients of the characteristic poly¬ 
nomial) of an re-by-re matrix M with poly(re)-bit integer entries. This can be done in space 
O(log^re) via a result of Buntrock, Damm, Hertrampf, and Meinel (see [BDHM92]). 

(o) Inverting an re-by-re matrix M with poly(re)-bit integer entries. This follows from n by express¬ 
ing the inverse of M in terms of the determinant of M and cofactor matrix of M. 

(p) Computing all roots of a polynomial of degree re with poly(re)-bit integer coefficients to within 
precision 2 “P°h(’^), This follows from a result of Neff and Reif; their algorithm uses space 
O(log’^re) (see [NR96]). 
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(q) Computing the eigenvalues of an n-by-n matrix M with poly(re)-bit integer entries to within 
precision This follows from p and n by computing the roots of the characteristic 

polynomial of M. 

It should be noted that many of these operations, when restricted to polylogarithmic space, 
require (to the best of our knowledge) superpolynomial running times. In particular, the above 
algorithm for matrix exponentiation (and more generally, Savitch’s algorithm for STCONN) re¬ 
quires time 0(2^°® ”). It is open whether every function computable in polylogarithmic space can 
be computed simultaneously in polylogarithmic space and polynomial time. We therefore cannot 
ensure the same time bound as in the original statement of Theorem C in [BGRI2]. 

2.3 Real computation 

Throughout the rest of the paper (and particularly in the next section) we will often have to work 
with binary representations of real numbers. We summarize in this section some common notation 
we use in the remainder of the paper. 

A real number x is given up to precision 2“"''' if x is given as an integer multiple of 2“”'". We 
say X is given up to precision 2 “P°h(»^) jg given up to precision 2“"'° for some c. We further 
assume that all numbers given this way are also bounded above in magnitude by 

We say we can compute a function f{x) up to precision 6 if there is an algorithm which, 
when provided with x up to a sufficiently high precision, computes a dyadic number x' such that 
\x' — f{x)\ < 6. We say we can compute a function f(x) up to precision in polylogarithmic 

(alternatively, logarithmic) space if, for each positive integer c, we can compute f(x) up to preci¬ 
sion 2“” in space 0(poly(logn)) (alternatively, O(logn)), where the degree of the polynomial is 
independent of c. 

In the statement of Theorem 1, we require that the function f(x) is (loge“^) -|- log-space in- 
tegrable. Formally, a function / : M —)■ M is 5 -|- log-space integrable, if, given an interval [a, b] 
(with a and b both given up to precision 2“P°^^^"'^) and a polynomial p{x) of degree poly(n) whose 
coefficients are all given to precision 2“P°^y("'\ it is possible to compute the integral f {x)p{x)dx 
to within precision 2 “P°h(’T’) space 0{S + logn). In the higher dimensional case where / is a 
function from to the interval [a, 6] is replaced by the box [ai, 6i] x • • • x [a^, 6^]. 

Finally, we define what we mean by the computation of an invariant measure of a dynamical 
system. We say a measure p! agrees with a measure p up to precision 5 if the total variation 
distance between p, and p' is at most 5. If measures p and p' are given by density functions, we will 
write \ \p — p'Woo to denote the L^o distance between the two density functions; note that since the 
size of our domain is normalized to 1, if \\p — p'\\oo < <5) then the total variation distance between 
p and p' is also at most 5. We say we can compute a measure p in space 0{S) if, for any interval 
[o, b], we can approximate the weight of p over [a, 6] to within precision 2“” in space 0{S + logn) 
(again, in the d-dimensional case, we replace the interval [o, b] with the box [ai, 6i] x • • • x [ad, 6(i]). 

3 Exponentiating matrices to large powers 

Iterative squaring allows us to compute powers of n-bit matrices up to exponents that are poly¬ 
nomial in n in polylogarithmic space. Proving Theorem 1, however, requires us to be able to 
exponentiate numbers and matrices up to exponents of size potentially exponential in n. 
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In this section we demonstrate how to raise matrices to exponentiaiiy iarge exponents using a 
poiyiogarithmic amount of space. In particular, we prove the following theorem. 

(Throughout this section, we take the norm ||M|| of a matrix M to be the maximum norm, i.e. 
the maximum absolute value of an entry of M). 

Theorem 5. Given ann by n matrix M whose entries are given up to precision 2“P°h(”') and an in¬ 
teger exponent E = 0(2P°h("-)), there exists an algorithm that computes in space 0(poIy(Iog n)) 
to within precision i/||M®|| < 2"' (and otherwise reports that ||M®|| > 2”j. 

Our general approach will be to construct a polynomial p{x) of degree at most n such that, for 
each eigenvalue A of M, p{X) « A®. It will then follow that p{M) « M®. 

We first show that we can reduce Theorem 5 to the case where M is diagonaiizabie with n 
distinct eigenvalues. 

Theorem 6. Given any n by n matrix M whose entries are given up to precision 2“P°h(”), an 
integer exponent E < 2P°*y("’) (that satisfies ||M^|| < 2^) and a precision 5 = 0(2“P°'y^”)), there 
exists an algorithm that computes in space 0(poIy(Iog n)) a matrix Mq with entries provided to 
precision 2“P°h(’^) such that Mq has n distinct eigenvalues and \ \M^ — Mq\\ < 6. 

Proof. Let D be the diagonal matrix diag(l, 2, 3,... , n), and set 

M{t) = M(1 -t) + Dt 

Let p{t) be the discriminant of the characteristic polynomial of the matrix M{t)] that is, if Xfit) 
are the roots of the characteristic polynomial of M{t), then 

i<j 

It is known that the discriminant of a polynomial P{x) of degree d can be computed as the 
determinant of a {2d — 1) by {2d — 1) matrix whose entries are coefficients of P{x) (see for instance 
[GKZ94]). Since the coefficients of the characteristic polynomial matrix are in turn polynomials in 
the entries of M, it follows that p{t) is a polynomial in t. Moreover, by equation 1, scaling a matrix 
by some multiplicative factor c multiplies the discriminant of the characteristic polynomial of this 
matrix by a factor of it follows that the discriminant of the characteristic polynomial of a 

matrix is a homogeneous polynomial of degree n{n — 1) in the entries of the matrix, and therefore 
p{t) has degree at most n{n — 1). Finally, since we can compute determinants and characteristic 
polynomials of matrices in poiyiogarithmic space (by remark n in Section 2.2), we can compute 
p{t) in poiyiogarithmic space. 

Note that since M(l) = D, it follows that p{l) = Y\i^j{i — j)‘^ 0, and therefore that p{t) is 

not identically 0. Now, let to be the largest power of 2 satisfying 

in = 2“'^° < _-_ 

^ - 100n(re - 1)2^E‘^\\D - M\\ 

and consider the n(n — 1) + 1 values t = fcto where k ranges from 0 to n{n — 1) inclusive. Since p{t) 
is a polynomial of degree n(n — 1) that is not identically 0, it can have at most n(n — 1) roots, so 
for at least one of these choices of k, p{t) / 0. Since p{t) is non-zero, no two eigenvalues of M{t) 
are equal. On the other hand, for this value of t, note that 
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(^ + {D- M) 


5k 


E 


100n(n - 1)2”£;2||Z) - M|| 


5k{D - M) 




100n(re-l)2’"||D-M|| 


< 


5 

Too 


It therefore suffices to take Mq = M{t). Since to = 2 the entries of Mq are all given to 

precision as desired. 

□ 

We next cite the following technical lemma about the minimum distance between distinct eigen¬ 
values of M. 

Lemma 7. Let p{x) be a degree n polynomial whose coefficients are integers all with absolute value 
at most A. Then for any two distinet roots ri / rj of p{x), 


fi ~ fj\ > 2,nA 


( 2 ) 


Proof. See [ColOl]. 


□ 


Corollary 8 . Let M be an n by n matrix whose entries are provided to precision 2 and 

are at most 2 P°*y(”') in absolute value. Then for any two distinet eigenvalues A* 7 ^ Aj of M, 
|Ai - Aji > 2-p°iy(”). 




and hence 


|Ai - Aji > 


□ 


Finally, we prove the following lemma bounding the size of the matrices related to the eigende- 
composition of a matrix M. 


Lemma 9. Let M be an n by n non-singular matrix with distinct eigenvalues whose entries are 
provided to preeision 2“P°^y(”'), and let D' he a diagonal matrix all of whose diagonal entries have 
absolute value at most 1. Then if we write M = U~^DU, the matrix M' = Lf~^D'U has entries at 
most 2 P°^y(”). 
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Proof. Let Ai, A 2 ,..., be the eigenvalues of M (i.e. the diagonal entries of D), and let /xi, /X 2 , • • • 
be the diagonal entries of D'. Consider the polynomial p{x) of degree at most n — 1 which maps A, 
to Pi for each i. By the Lagrange interpolation theorem, we can write p{x) as 




Pi 


{x 


i=l j^i 


(Xi 


M 

A,) 


By Corollary 8, for all i / j, |Aj — Xj\ > Combining this with the fact that \pi\ < 1 

implies that all coefficients of p{x) are at most 2 P°h("-) absolute value. 

Consider now the matrix p{M). Note that since p{D) = D', p{M) = M'. But since all the 
entries of M are at most 2P°h("'), the entries of p{M) will be at most 2P°h("'), and hence the entries 
of M' are at most 2P°^y(’^\ 

□ 


We now proceed to prove Theorem 5. 

Proof of Theorem 5. By Theorem 6 we can assume without loss of generality that M is diagonal- 
izable with distinct eigenvalues. We begin by finding the eigenvalues of M. By remark q of Section 
2.2, it is possible in polylogarithmic space to compute the eigenvalues of M to within any precision 

2-poly(n)^ 

Let Ai, A 2 ,..., An be the eigenvalues of M. For each A*, let A* be our approximation to Aj (so 

that |Ai — Ai| < 2“P°^y("') for some choice of poly(n)). We now construct via Lagrange interpolation 

~ E 

the polynomial p{x) such that for each i, p{Xi) = A* (note that by Theorem 26, we can compute 
Aj to within any precision 2“P°*y(”) in logarithmic space). We wish to show that we can ensure 
(via approximating the roots with fine enough precision) that |p(Aj) — A^l < 2“P°h(’^) for any given 
choice of precision 2“P°'y^”). 

To show this, hrst note that the Lagrange interpolation formula says that we can write p{x) as 


p{x )= 

1=1 j^i 


jx - Xj) 
(A'. - A,) 


Recall that, by Corollary 8, for all f / j, | Aj — Aj| > 2 for some constant a. In addition, Xf 
is at most ||M®|| which by our assumption is at most 2”. Hence, all the coefficients of p{x) have 
magnitude at most 2"’ (2“"'“) < 2”“^^. 

Next, note that if p{x) is a polynomial of degree d all of whose coefficients are at most A in 
absolute value, then 


\p{x)-p{y)\ < 


i=0 


x^\x-y\Yl 


2=1 


2—1 

j=0 


-3 


< (i^2lmax(|x|, lyl)'^ ^|x —y| 


Since |Ai — Ai| < 2 for some constant h and |Ai|'^ ^ < |Ai|^ < 2"', then it follows that. 


(3) 

(4) 

(5) 
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Therefore, as long as we choose b > a + 2, |p(Aj) — p{Xi)\ will be at most Since 

p{Xi) = \f, and since \Xf — A^l « T'|Ai — Ajj < 2“"’*’|Ai — Ajj, it follows that 

\p{Xi) - Af I < + E2-^' 

Since E < E < 2”'" for some c. For any c', choosing b = c+c' ensures that |p(Ai) —Af^| < 

2 “"’ , as desired. 

Finally, consider the matrix p{M). We claim that each entry of p{M) — has absolute value 
at most To see this, note that if we diagonalize M as M = U~^DU, where D is a diagonal 

matrix containing the eigenvalues of M, then p{M) — = U~^{p{D) — D^)U. Each diagonal 

entry of p{D) — is of the form p{Xi) — Xf and therefore by the above discussion has magnitude at 
most 2“"''' , for any c' of our choosing. Rewriting p(M) — M^ in the form 2“”''" [/~^2’^‘' {p{D) — D^)U 
and applying Lemma 9, it follows that (for sufficiently large d) each entry of p{M) — also has 
magnitude 2“^°'^^”^. 

It therefore suffices to compute p{M). Since we can compute the coefficients of the polynomial 
p in polylogarithmic space and since we can compute for any k < n in polylogarithmic space 
via repeated squaring, we can compute p{M) in polylogarithmic space, as desired. □ 

4 Computing invariant measures in small space 

In this section we prove Theorem 1. 

This theorem can be seen as a refinement of Theorem C in [BGR12]. Our strategy, therefore, 
will be mainly to adapt the algorithm described in the proof of Theorem C, taking care to implement 
each step in polylogarithmic space. 

For completeness, we will first describe the algorithm presented in [BGR12]. We defer the 
analysis of this algorithm to the original paper. 

Recall that Theorem C states 

Theorem 10. Let Sg be a computable dynamical system defined by a continuous function f from a 
compact space M to itself and a Gaussian noise kernel Assume also that f is polynomial¬ 

time integrable (i.e. it is possible to integrate the convolution of powers of f with polynomial 
functions in polynomial time). Then computing p to within precision 5 < 0(e) requires time and 
space Os,e(poly(log(l/(i))). 

The algorithm used in the proof of Theorem C proceeds as follows. 

1. Begin by partitioning M into A regions each with diameter at most e. Assign each atom 
a center Xj G a*. 

2. Let pd)(x) be the probability density function of the system at time t (given some arbitrary 
initial distribution (x))- Then on each of the regions a,, p^^'^ (x) can be written as a Taylor 
series in (x — xfi. In particular, we have that 
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A oo 

^W(a;) = ^ l{x G o.i]^pfj^{x - Xif 
i=l k=0 

where pi^k £ are the coefficients of these Taylor series. The coefficients at time t + 1 are 
related to the coefficients at time t via the following linear map. 




(y - xjr 


diKf{y,Xi) 


Call this linear map P. The coefficients of P can then be computed to arbitrary precision by 
computing convolutions of derivatives of the noise kernel with certain polynomials (which is 
possible in polynomial time by our assumption). 

3. For any positive integer N, ignoring all terms in the Taylor expansion of degree larger than 
N truncates the transition map P to form a finite linear map Pn (representable as an AN 
by AN matrix). The analysis in [BGR12] proves the following lemma. 

Lemma 11. There exist log-space computable functions t{5) and N(d) such that 


for all (5 > 0, uniformly in p, where 


t{S) = 0(log(i ^ exp (e ^)) 

N{d) = O (log (5"^ poly (e"^)) 

Proof. See Theorem 36 in [BGR12]. Explicit expressions for t{6) and N(d) can be found in 
the proof of Theorem 36. □ 

By repeated squaring, we can compute in time 0(poly(A^(5)) log t{6)) = Oe(poly(log 5~^)). 

The above lemma implies that the measure given by is within 6 of the invariant measure 
p, as desired. 

We now proceed to prove Theorem 1. As in Theorem C in [BGR12], we initially restrict 
ourselves to the one-dimensional case for clarity. We later describe the changes necessary for the 
d-dimensional case. 

Theorem 1. Let X = [0,1]. If the noise Pj-( 3 ;)(') Gaussian, and f is {log ^)+log-space in- 
tegrable, then the computation of the invariant measure p at precision 5 can be done in space 
0 (poly (log i-h log log I)). 
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Proof. We describe how to adapt the algorithm presented above so that it can be performed in 
poly-logarithmic space. 

In order to show we can execute the above approach in polylogarithmic space, we must show we 
can both compute the coefficients of the matrix P to within poly(h) accuracy and that we can then 
subsequently exponentiate the truncated matrix P]\f(^s) to the power t{6). Note that the coefficients 
of P are given by the expression 





In the case of a Gaussian kernel, 


Kf{y, Xi) = exp - Xif 

ev 

We can expand this expression out via the Taylor series for exp(x). Since (/(y) — xf) is bounded 
(by the diameter of M, for example), to approximate this integral to within <5, it suffices to take 
the first poly (^ -|-log terms of this expansion. We can therefore approximate P^'^’^\l,m) as a 
linear combination of poly (7 -|- log 7 ) terms of the form 



xjrfiy)^dy 


By our assumption, we can evaluate each of these integrals (to within precision poly((5)) in 
space ©(loglog^). The coefficients of the linear combination can also each be computed in space 
complexity O (poly (log 7 -|- log log 7 )) via the comments in Section 2.2 (in particular, h and i), and 
hence the entire linear combination can be computed in this space complexity. The normalization 
constant Ce(xj) can similarly be computed in this space complexity by expanding out exp(y — 
as a Taylor series in y and integrating over [0,1]. 

Finally, we must compute Note that since t{d) is exponential in e“^, we cannot compute 

in space O(poly(log log -|- loge“^)) via repeated squaring, as in the proof of Theorem C. 

Instead, we apply the algorithm presented in Section 3; by Theorem 5, this allows us to compute 
to within precision 6 in polylogarithmic space. 

Since is an AN{6) by AN{6) matrix, where A = 0(poly(l/e)) and N((5) = 0(poly(log(l/(5)-|- 
1 /e)), it follows that computing P^^]) can be done in total space complexity 


O (poly (log TN((5))) = O 



log i -h log log i 


□ 


Remark 12. To extend this result to the case of d dimensions, we can follow essentially the same 
procedure; the only change is that we now must write the density functions yd) (^x) as multivariate 
Taylor series in (x — xj) (and each of the components of / must be (log e~^) + log-space integrable). 
Since the number of terms in the multivariate Taylor expansion of degree at most N is 0{N'^), 
the truncated matrix P^r still has size polynomial in 7 and log 7 , so the invariant measure can be 
computed in space complexity 
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( 6 ) 


poly 


d + log - + log log \ 
e 0 


4.1 Computing Taylor coefficients of / 

Theorem 1 relies on the assumption that convolutions of powers of / with polynomials are log-space 
integrable. While this assumption holds true for many natural choices of /, it is perhaps not the 
easiest condition to work with, and one might hope for a more natural constraint on /. In this 
section, we show an alternate constraint which implies our previous assumption; namely, that / 
is log-space computable, smooth, and has bounded Taylor coefficients. Recall that / is logspace 
computable if given x on the input tape, f{x) can be computed within precision 2“"' using space 
O(logn). We prove the following theorem^. 

Theorem 13. Let f be a function that is log-space computable, smooth, and for some constant rj, 
satisfies (for all x) 


\d'^fix)\ < klrj^ 

Then it is possible to compute integrals of the form 


[ {y-x^rf{yfdy 

Jaj 

where diamOj < -^ to within precision S in space logarithmic in m, k, logy and log 1/5. 

Remark 14. We note that if / is analytic in [a,b], then such a constant rj always exists. In fact, 
if we let p to be a strict lower bound of the set of all the radii of convergence of the Taylor series 
with centers in [a,b] (note that p > 0 by compactness), then the integral Cauchy formula implies 
that, for any x G [a, b] 


1^ Mkl 

d'^fix) < 

pK 

where M is any upper bound of / over [a, b]f’ = {z G C : \z — x\ < p for some x G [a, b]}. 

To prove the above theorem, we first show that if / satisfies the above constraints, then it is 
possible to compute its Taylor coefficients in logarithmic space. 


Lemma 15. Assume f is log-space computable, smooth, and satisfies \d^f{x)\ < klrf^ for all x in 
the domain. Then for any Xc, we can write 


fix) = '^CLkix - Xc)^ 
k 

The value of Ok is then computable to within precision 5 in space logarithmic in k, logl/p, and 
log 1/5. 

similar theorem holds under the assumption that / is computable in poly logarithmic space. The conclusion 
is then that the integrals are also computable in polylogarithmic space — which suffices to obtain the conclusion of 
Theorem 1. 
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Proof. We claim that if we choose t = dr] then 


kW^ 




< <5 


(7) 


Note that since / is computable in log-space, the quantity on the LHS of equation 7 is com¬ 
putable in space O (log k + log log 1/r) = O (log k + log log rj + log log 1/(5), as desired. 

To prove equation 7, recall that the Lagrange remainder theorem for Taylor series says that for 
any x (within the radius of convergence of the Taylor series about Xc), we can write 


fix) = 


+ \k +1)? 


\i=0 


for some ^ between Xc and x. Write f{x) = {Yf!i=QCLi{x — Xc)*^ -|- i7A;-i-i(x). By our constraint on 
/, we know that 


(fc-Hl)! 


< 7/^+^, SO we can rewrite this as 

\Rk+iix)\ < T]'"^^{x - Xc)^^^ 


Next, recall the following binomial identities. For all r < k, we have that 

^ ^k 

i 


E=’-(-T- J=o 


i=0 

On the other hand, when r = k, we have that 

k 




i=Q 


Substituting in the Taylor expansion for / and applying the above binomial identities, we see 


that 





T,i=0^k+lix + iT)i-l)’^ *(^) 

kW^ 



klr^ 


< 

< 

< 

< 


kW^ 

TT]' 


1 ^ 

CeE 

1=0 

fc-l-1 k 

kl 


.^k-\-l 



as desired. 

We can now prove Theorem 13. 


□ 
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Proof of Theorem 13. We wish to compute the integral 



-x^rf{yfdy 


where we know that diamaj < Write f{y) = ^ ai{y — Xjf] by our assumption |ai| <rf for all 

i. 

Let fM{y) = ^i{y ~ ^jY- Then we have that 


CX> 


\f{y)-fM{y)\ = 

ai{y-XjY 

00 

< 

^ \ai\-\y-Xj 


=M +1 

< 



2 = M + 1 


= E 

i=M+l 
= 2-^ 


Since f{y) € [0,1], this further implies that \ f{y)^ — fM{y)^\ < k2 it follows that if we take 
M = log(5/A;), then \f{yY — fM{y)^\ < d, and in particular 



Xj)'^fM{y)^dy 


< <5 


But note that by Lemma 15, we can compute each of the coefficients of fniy) (to within 
precision poly(5)) in space logarithmic in M, logl/?], and log 1/5. We can then compute the 
coefficients of {y — xj)"^fM{y)^ via remark i of Section 2.2, and hence compute the integral over aj 
in space O (log k + log m + log log rj + log log 1/5), as desired. □ 


Theorem 2 now follows as a straightforward corollary to Theorem 13. 

Theorem 2. Let X = [0,1]. If the noise Pj( 2 .)(‘) is Gaussian, and f is log-space computable, 
smooth, and (for some i] > 0) satisfies \d^f{x)\ < klrf for all x, then the computation of the 
invariant measure y at precision 5 can be done in space O (poly (log 77 + log ^ + log log |)). 

Proof. In the proof of Theorem 1, we make the slight modification that instead of simply picking 
the regions a* to satisfy diama* < e, we instead make them satisfy the stronger requirement that 
diamoj < min(e, 1/(27/)). Then, by Theorem 13, we can compute all the necessary integrals in 
logarithmic space, as before by the earlier assumption. 

Since the number of regions is linear in 77 , the resulting space bound is 0 (poly(log 77 + log - + 
log log j)), as desired. □ 
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5 Space lower bound for computing invariant measures 


In this section we prove Theorem 3. We begin by proving a weaker version of Theorem 3 where we 
don’t restrict our constructed function / to be analytic (or even continuous). 

Lemma 16. Any algorithm that can compute the invariant measure p of a dynamical system to 
within precision 6 with Gaussian noise kernel requires space at least II (log ^ + log log |). 

Proof. Since our output is of size log it requires H (log log j) space to simply keep track of which 
bit we are currently outputting. This immediately shows the II (log log j) part of the lower bound. 

It remains to show the H (log i) portion of the lower bound. We will present a SPACE(log M)- 
reduction from SPACE{M) to the problem of computing the invariant measure of a noisy dynami¬ 
cal system with e = , thus showing computing the invariant measure of a noisy dynamical 

system requires space at least Il(logi). 

More specifically, we will show how to convert any Turing machine T with tape size M along 
with an input s into a function f : X ^ X that ‘embeds’ this machine/input pair. We will 
construct this embedding so that the invariant measure of the corresponding dynamical system will 
have significant measure on some subset of the domain X if T accepts s and close to zero measure 
otherwise. 

Let S be the total number of states of the Turing machine T (including the current state of the 
tape, so S = 0(2^)), and let N = 2S^. Choose X to be the unit interval [0,1], and partition X 
into the N intervals X^ = [;^, for 0 < k < N. Let Ck = be the center of interval X^. 

Choose e (the size of the Gaussian noise) so that = 1 — since the tail of a 

Gaussian decreases to 0 exponentially quickly, it suffices to take e = II(V“^) = (then this 

integral corresponds to the probability of being at least II(V) standard deviations away from the 
mean). 

Finally, if x € Xk, then we define / so that f{x) = Cgucc(fc)! where succ(A:) : {0,N — 1} ^ 
{0 ,... ,N — 1} is defined as follows. 

(i) If A: < 5^, set {v,t) = ([^J,A; — ^[^J). We will interpret V as the binary representation of 
some state of T, and I as a counter of how many steps we have run machine T for so far. 

(a) If V is an accepting state, set succ(A:) = 

(b) If u is a rejecting state, set succ(A;) = sS, where s is the initial state of the Turing 
machine T. 

(c) If I < 5 — 1 and v is neither an accepting or a rejecting state, find the successor state v' 
of V according to the Turing machine T (note that since computation is local, this can 
be done in space O(logM)), and set succ(/c) = v'S -|- (I -|- 1). 

(d) If I = 5 — 1, set succ(A;) = sS, where s is the initial state of the Turing machine T. 

(ii) If 5^ < A; < 2S‘^ — 1, then succ(A:) = A: -|- 1. 

(iii) If A: = — 1, then succ(A:) = where s is the initial state of T. 

Intuitively, this function / simulates the Turing machine T for up to S time steps (the maximum 
amount of time a Turing machine with S states can take to reach an accepting state). If, within 


16 



these S time steps, we encounter an accepting state, we go on a walk for another S time steps 
through [1/2,1] and then return to the initial state; otherwise, if we encounter a rejecting state 
(or run for S steps without accepting or rejecting), we immediately return to the initial state. In 
this way, if s is an accepting initial state, the invariant measure will have approximately half their 
weight on the interval [1/2,1], and if s is not an accepting initial state, the invariant measure will 
have approximately no weight on [1/2,1]. We formalize this intuition below. 

Let ^ be the invariant measure of this dynamical system perturbed by Gaussian noise of variance 
with e as chosen above (note that since the noise is Gaussian, there must be a unique invariant 
measure; this follows from the fact that for any set U of positive measure, the probability xt+\ G U 
given xt is always strictly positive). We claim that if T eventually accepts on s, then /r will have 
measure at least 1/3 on [1/2,1]. Otherwise, /i will have measure approximately 0 on [1/2,1]. 

Let S = {sS”, succ(s5), succ(succ(s5)),... } be the set of iterates of the initial state s of 
our Turing machine under this successor function. Note that if T accepts starting on s, then 
{S^,..., 28“^ — 1} is a subset of S; otherwise, if it rejects or fails to halt, then {5^,..., 25^ — 1} is 
not a subset of S. 

We first claim that the weight of the invariant measure ^ over states in S is at least 1 — N~^^. 
To see this, let xi,X 2 , • • • be a sequence of iterates of our dynamical system. Gall a time t bad if 
Xt G Xk but xt+i 0 X^^cc{k)- By our choice of e, the probability of any given time t being bad is 
at most and is independent of all other times being bad. In addition, by our construction, 

after N noise-free steps we are guaranteed to be in S, since after N steps of succ(A:) we must pass 
through sS. If we let Xs = UkesXk, it then follows that the probability that xt G Xs is at least 
(1 _ 7Y-i00)W >i_ Ar-99. 

Next, assume that T accepts on s, and let Xpath = = [1/2,1]; note that Xpath is a 

subset of Xs- We claim that the weight under the measure ^ of Xpath is at least ^(1 — 25“®) of 
the weight of Xs- To see this, call the sequence xt, xt+i,..., good if no time t + i is bad for 

any 0 < i < |5| (in other words, no low probability noise events occur for |5| steps). Note that this 
occurs with probability at least (1 — > 1 — But in any good sequence, each element 

of S appears exactly once; it follows that, asymptotically, the probability that xt belongs to Xpath 
given that xt belongs to Xs is at least 

(1 _ iV-®®) > (1 - = i(l - iV-®®) 

Gombining these two results, it follows that the weight of the invariant measure over Xpath is 
at least 


-(l-5-®®)2>i 
2 ^ ^3 

On the other hand, if T does not accept on s, then [1/2,1] n Xs = 0, and therefore the weight of 
over [1/2,1] is at most A^“®® <c 1/3, as desired. 

□ 

Note that, since the function constructed in this reduction is piecewise linear with 0{2^) pieces, 
it is in fact (loge“^) + log-space integrable in the sense of Theorem 1. On the other hand, this 
function is not continuous (let alone analytic), and hence does not satisfy the conditions of Theorem 
2 . 
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To prove Theorem 3, we transform the above example into a uniformly analytic function by 
replacing each of the intervals in the construction in Lemma 16 with an analytic approximation 
to a step function. We describe this below, starting with the construction of our analytic ‘step 
function’. 

Lemma 17. For any a, {3 >0, there exists an analytic function F{x) : M — )• M that satisfies the 
following constraints: 

• For all X < —a, |T(x)l < /3. 

• For all X > a, \F{x) — 1| < /3. 

• For all integer k >0 and all x, \d^F{x)\ < k\rf for some rj = 0(a~^ log/3“^). 

• The function F{x) is computable to within precision 5 in space 0(loglog5“^). 

Proof. We will consider functions of the form 

(«) 

where C is a positive integer. Note that in order for \F{x) — 1| to be less than /3 for all x > a, we 
must have 


1 + e-C'" 


- 1 


</3 


which is satisfied when 


C > a ^ log 


Likewise, in order for |T(x)| to be less than fi when x < —a, we must have 

1 


1 + 


<fi 


which is satisfied when 


C > a log 






Therefore to satisfy the first two requirements, we can take 

C = 


-m 1-/3' 

a log 


/3 


a ^ log fi ^ 


To prove the third requirement, note that we can write 

1 


F(x) = - ( 1 + tanh 


~Y 


By [AS65], it is known that (for x > 0), 
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tanh(x) 




2k+lg2x 

(1 + 


k-l 

E 

j=0 


< 


(1 + e2^)^+i ^ 


fc+i 


= 2 


< 2^+^fc! 


i=o 

2x \ k+1 


1 + e- 


2x 




fc! 


where are Eulerian numbers of the second kind (in the third line we use the fact that — 

n\). Since tanh(x) is an odd function, the same bound holds for x < 0. It follows that for all A: > 0 
and all x, 


\d’^F{x)\ < C'^kl 

and therefore we can take rj = C (for A; = 0, it suffices to note that |E(x)| < 1 for all x). 

Finally, since we can compute to within precision <5 in space O(log log <5“^) via Lemma 24, 
and since we can perform all arithmetic operations to within precision 6 in space 0(loglog(I“^) via 
the remarks in Section 2.2, it is possible to compute F{x) in space 0(loglog5“^). □ 


We now proceed to prove Theorem 3. 

Theorem 3. Any algorithm that can compute the invariant measure p. to within precision 5 of a dy¬ 
namical system with Gaussian noise kernel and analytic transition function f{x) (that uni¬ 

formly satisfies \d^f{x)\ < klrj^ for some rj = poly(e“^) j requires space at least Q (log ^ + log log |). 

Proof. We will use the function F{x) dehned in Lemma 17 to approximate the function /(x) dehned 
in the proof of Lemma 16 with an analytic function. We will then show that the dynamical system 
corresponding to this new / still has the property that it has signihcant measure on the interval 
[1/2,1] if and only if the Turing machine T accepts s. 

As before, let S = 2^ be the number of states of the Turing machine 
Partition the interval [0,1] into the N intervals = [;^, for 0 < A: < 
be the center of interval X^. Let succ(A;) be defined equivalently as in the 
Then, in Lemma 17, set a = (3 = and consider the function 


T, and let N = 25^. 
N, and let 

proof of Theorem 16. 


N 

/(x) CsuQ(;( 0 ) T ^ ^ (Csucc(i) Csuc;c(i— 1 )) F 

i=l 



(9) 


Note that by Lemma 17, this function / satisfies the following condition: if |x — c^j < ^ — a, 
then |/(x) — Csucc(fc)l ^ F'P = 0(5“®®). We will next claim that if we set e = 5“^®, then we 
simultaneously have that 


maxpg(x) < 

X 


gio 




( 10 ) 
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and that 


^ 2iV “ 


( 11 ) 


To show the first of these inequalities, note simply that Pe{x) < inequality 10 then follows 

from substituting e = . To show the second inequality, note first that ^ — ce — N(5 > 

Hence the integral in inequality 11 is at most the probability that the noise is within ^ 

standard deviations of its mean. By Chebyshev’s inequality it follows that this probability is at most 
1 — IbS”^®, from which this second inequality follows (much better bounds are in fact possible). 

We can now proceed to analyze the invariant measure p of this dynamical system. For each k, 
let Tfc = [ck — + ct, Ck + — a], and let Y = We will first show that p has measure 

at least 1 — 325“^® on Y. 

Let xi,X 2 , - ■ ■ be a sequence of iterates of this dynamical system. Call a time t bad if xt G Yfc 
but xt+i 0 By inequality 10, the probability that a time t is bad (given that xt G Yk for 

some k) is at most 16S“^®. It follows that Pr[xt+i 0 Y\xt G T] < IbS"”^®. On the other hand, note 
that if Xt 0 Y, then by inequality 11, the probability xt+\ is in Y is at least 


cio 

1 
2 

It follows that the weight of /r over Y must be at least 0.5/(0.5 + 165“^®) > 1 — 325'“^®, as desired. 

Next, as before, let S = {sS*, succ(s5), succ(succ(s5)),... } be the set of iterates of the initial 
state s of our Turing machine. If T accepts starting on s, then {5^,..., 25^ — 1} is a subset of S\ 
otherwise, if it rejects or fails to halt, then {5^,..., 25^ — 1} is not a subset of S. Let Ys = Ufcg^Yfe. 
We will next show that the weight of p over I 5 is at least 1 — 

To prove this, recall that if we start at some x €Y, after N noise-free steps, we are guaranteed 
to be in I 5 . Since the weight of p over Y is at least 1 — 32S“^® and since the probability a string 
of N steps are all good is at least (1 — > 1 — 325'“^'^, the weight of p over I 5 is at least 

(1 - 325-^6)(1 - 325 - 1 ^) > 1 - 645-^1 

Finally, assume that T accepts on s, and let Ypath = = [1/2,1]; note that Ypath is a 

subset of I 5 . We claim that the weight under the measure p of Ypath is at least ^(1 — 325'“^^) of 
the weight of I 5 . To see this, call the sequence Xt-,Xt+i,... good if no time t -|- i is bad for 

any 0 < i < |5|. Note that this occurs with probability at least (1 — 165'“^®)'^ > 1 — 325'“^^. But 
in any good sequence, each element of S appears exactly once; it follows that, asymptotically, the 
probability that xt belongs to Xpath given that xt belongs to Xs is at least 

(1 - 325-i")j^ > (1 - 325-14)^ = i(l - 325-14) 

Combining these two results, it follows that the weight of the invariant measure over Ypath (and 
hence [1/2,1]) is at least 


1 — ^maxpg(x)^ |X \ y[ > 

> 

> 
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Zi O 

On the other hand, if T does not accept on s, then [1/2,1] n I 5 = 0, and therefore the weight of fi 
over [1/2,1] is at most <C 1/3, as desired. 

□ 
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A Space-Bounded Church-Turing Thesis 

This appendix serves as a short introduction to the Space-Bounded Church-Turing thesis (hereafter 
referred to as the SBCT). For more further details, we suggest the reader consult [BRS15]. 

Let S = Xt he a closed, discrete-time stochastic system over a state space A. We define the 
memory available to S as 


M{S) = sup sup Ixtr.t,{Xt-,Xt+i) (12) 

t fl 

where the inner supremum is taken over all distributions p, over A. Here, I (Xt] Xt-\-i) is Shannon’s 
mutual information and is a measure of how much information is preserved from time t to time 
t J- 1. If f(x,y) is the PDF of the distribution of {Xt,Xt+i), then I(Xt]Xt+i) is defined via 

I{Xt]Xt+i) = J j f(x,y)\ogj^3j^dxdy 
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We can now state a concrete form of the SECT (in [BRS15], the statement below is referred to 
as the Simulation Assertion). 

Conjecture 18. The problem of computing the asymptotic behavior of a stochastic system S with 
memory M = A4(S) to within precision 2“"' is in the complexity class SPACE((M + log 

While this conjecture can be easily falsified by an artificial construction, a case can be made 
that it holds for physically relevant systems. In particular, the present paper establishes Conjecture 
18 in the case where 5 is a dynamical system with e-Gaussian noise. 

Lemma 19. If S is a dynamical system over X = [0,1] with Gaussian noise kernel then 

M(S) = 0(loge-i). 

Proof. We can write I{Xt]Xt+i) as H{Xt+i) — H{Xt+i\Xt) (here H{x) is the differential Shannon 
entropy; see [CT91]). Let p{x) be the PDF of Xt+i] by Jensen’s inequality, note that 

H{Xt+i) = - p{x)logp{x)dx < - 
Jo 

On the other hand, Xt+i\Xt is a Gaussian with variance e; the differential entropy of such a 
distribution is given by ln(e\/27re) ([GT91]). Hence it follows that I{Xt]Xt+i) < — ln(e\/2vre) = 
O(logi). □ 

Corollary 20. Conjecture 18 is true for the case where S is a dynamical system over X = [0,1] 
with Gaussian noise kernel ■ 

B Exponentiating numbers to high powers 

In 1984, Alt showed how to compute x® (for potentially exponentially large E) in logarithmic space 
by computing exp(Elogx) (see [Alt84]). For completeness, we include in this appendix a proof of 
Alt’s result. Formally, we prove the following theorem: 

Theorem 21. Given a positive real number x presented in binary up to precision 2 “P°h("') and an 
integer exponent E < 2 P°h(«)^ there exists an algorithm that computes y = in space O(logre) to 
within precision 2“P°^^(”) if y if‘IE (and otherwise reports that y > 2^). 

Remark 22. The condition that we only output y to within precision 2 “P°h(’^) if y < 2” is crucial. In 
general, it is possible for x® to be on the order of 2 ^'’°'^^'"', and hence require an exponential number 
of bits in n to represent to within precision 2 “P°h(^). However, a machine with space O(logn) only 
has poly(n) different states, and hence cannot hope to output a binary string of length exponential 
in n. 

Instead of proceeding via iterative squaring, our algorithm first calculates logx to sufficient 
precision, then computes exp(£'logx). To do this, we hrst show that we can compute the functions 
log(x) and exp(x) in logarithmic space. 

Lemma 23. Given a positive real number x up to precision 2 “P°h("')^ can compute logx to 
within precision 2 “P°*y^”) in logarithmic space. 


I log Idx = 0 
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Proof. First, find an integer v such that G [1,1.5); if I is the length of the binary represen¬ 

tation of the integer part of x, then we can choose v to be either 2 £ or 2i + 1. Set w = 

Then log x = log w + v log y/2. 

We have therefore reduced the problem to computing the logarithm of numbers within the range 
[1,1.5]. To do this, we will use the following Taylor expansion for log(l -|- z): 

CO ^ 

1o6(1 + 2) = 5;(-1)‘+1^ 

k=l 

Since z = x — 1 < ^ for all x in [ 1 ,1.5], the error from truncating after k terms is at most 2 “^. 
Therefore, to evaluate log(l -t- z) to within precision suffices to sum the first poly(n) 

terms of this series. By comments f and g of section 2.2, each of these terms can be evaluated in 
logarithmic space, and hence logx can be computed to within precision 2 “” in space O(logn), as 
desired. □ 

Again, when computing exp(x) in logarithmic space, we must ensure the output does not have 
length exponential in n. For that reason, we restrict ourselves to computing exp(x) for values 
X < poly(n). 

Lemma 24. Given a positive real number x < poly(n) to within precision we can compute 

exp(x) to within precision 2 “^°^^^”^ in logarithmic space. 

Proof. Again, we will compute exp(x) via its Taylor expansion, truncating after a suitable number 
of terms. Recall that the Taylor expansion of exp(x) is given by 

CXD U 

exp(x) = 

k=0 

By Lagrange’s remainder theorem for Taylor series, we have that the error from truncating after 
k terms is at most 


exp(x)x^’''^ 

{k + l)l 

Choosing k on the order of x^^ (for some constant d) and applying Stirling’s approximation, 
guarantees that this error is at most 2“”'*; it therefore follows that to evaluate exp(x) to within 
precision it suffices to compute and sum a polynomial number of terms of this series, 

which can be done in logarithmic space. □ 

We now proceed to prove Theorem 21. 

Proof of Theorem 21. As mentioned earlier, we will compute x® by computing exp(i?logx). Since 
gX+tS _ gX jgX^ computing exp(Fllogx) to within precision e requires us to compute FI logx to 
within precision b = ej exp(Fl log x). Since we need only do this in the case where exp(Fl log x) <2"', 
we can assume b < e2“” = 2“^°^^^"'^ 

Hence, first compute (via Lemma 23) E log x to within this required precision, and then compute 
(via Lemma 24) exp(FHogx) to within the desired precision. (If FHogx > nlog2, then we can 
instead output that y > 2”). This completes the proof. □ 
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Finally, we demonstrate how to extend this result to the case of computing powers of complex 
numbers. Our approach will be similar, except we will consider the phase and amplitude separately. 
We first show it is possible to compute (in logarithmic space) the argument of a complex number. 

Lemma 25. Given a complex number z = x + yi where x and y are given to precision we 

can compute arg(z) to within precision in logarithmic space (where arg(z) is the unique 

9 G [0, 27r) such that z = Me*® for some positive real M). 

Proof. Set u = exp(—27ri/6). Find an integer v G [0,12) such that P = uPz has an argument in 
the range [0, arctan(0.6)) (note that tt/S < arctan(0.6)). Then, arg(z) = arg( 2 ;') — 

We therefore wish to compute arg(z) for those z whose argument lies in [0,0.6). For these z, 
arg( 2 :) = arctan (|), so it therefore suffices to compute arctan(t) for t G [0,0.6). 

As before, we will proceed via evaluating the Taylor series of arctan(t). Recall that the Taylor 
expansion of arctan (t) is given by 


oo ^2k-\-l 

arctan(t) = ^(-l)^ 2 fc + i 
k=0 

By comparison to a geometric series, the error from truncating after k terms of this series is 
at most 2“^. Therefore, to evaluate arg(z) to within precision it suffices to sum the first 

k = poly(n) terms of this series. Since computing each term can also be done in logarithmic space, 
it is therefore possible to compute arg( 2 ;) to within this precision in logarithmic space. □ 

Theorem 26. Given a complex number z = x + iy with x and y presented in binary up to precision 
2 “P°b("-) and an integer exponent E = 0(2^°*^^”')), there exists an algorithm that computes P = z^ 
in space O(logn) to within precision 2“P°h(’^) if\z'\ < 2"' (and otherwise reports that \z'\ > 2"'J. 

Proof. Write z = re*^, with r = and 6 = arg(z). Note then that z' = 

Given x and y, we can compute r to any desired precision 2“^°'^^"') and hence by Theorem 
21, we can also compute r^ to within any precision 2 “P°A(’^) [a,s long as r® < 2”). Likewise, by 
Lemma 25, we can compute 9 to within any precision 2“P°A("-), and hence compute 9' = E9. By 
reducing 9' modulo 27r so that 9' G [0,27r), we can then compute exp(i0') = cos{9') + isin(0') to 
within precision 2“P°A(’^) via similar logic to Lemma 24. 

Finally, these two pieces let us compute z' = r^exp(i0') to within precision 2“P°A("'), as desired. 

□ 
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